An architecture for parallel topic models

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Architecture for Parallel Topic Models

This paper describes a high performance sampling architecture for inference of latent topic models on a cluster of workstations. Our system is faster than previous work by over an order of magnitude and it is capable of dealing with hundreds of millions of documents and thousands of topics. The algorithm relies on a novel communication structure, namely the use of a distributed (key, value) sto...

متن کامل

Scalable Parallel Topic Models

U) The topic model is a popular probabilistic model for text and document modeling. It can be used for topic indexing, document classification, corpus summarization and information retrieval. In the past, topic models have been applied to corpora containing thousands to hundreds of thousands of documents. Now there is an increasing need to model collections with millions to billions of document...

متن کامل

Model-Parallel Inference for Big Topic Models

In real world industrial applications of topic modeling, the ability to capture gigantic conceptual space by learning an ultra-high dimensional topical representation, i.e., the so-called “big model”, is becoming the next desideratum after enthusiasms on ”big data”, especially for fine-grained downstream tasks such as online advertising, where good performances are usually achieved by regressio...

متن کامل

Communication-Free Parallel Supervised Topic Models

In this project, we develop a parallel algorithm for supervised latent Dirichlet allocation (sLDA) Mcauliffe & Blei (2008) which maintains the speed advantage of communication free parallel computing in Neiswanger et al. (2013) while at the same time bypassing the problematic quasiergodicity in the local posteriors combination stage. Since the main objective of sLDA is prediction rather than me...

متن کامل

The Topic Browser An Interactive Tool for Browsing Topic Models

Topic models have been shown to reveal the semantic content in large corpora. Many individualized visualizations of topic models have been reported in the literature, showing the potential of topic models to give valuable insight into a corpus. However, good, general tools for browsing the entire output of a topic model along with the analyzed corpus have been lacking. We present an interactive...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the VLDB Endowment

سال: 2010

ISSN: 2150-8097

DOI: 10.14778/1920841.1920931